Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 1048575 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 88.0 MiB |
| Average record size in memory | 88.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 3 |
isFlaggedFraud has constant value "" | Constant |
nameOrig has a high cardinality: 1048317 distinct values | High cardinality |
nameDest has a high cardinality: 449635 distinct values | High cardinality |
amount is highly overall correlated with oldbalanceDest and 1 other fields | High correlation |
oldbalanceOrg is highly overall correlated with newbalanceOrig | High correlation |
newbalanceOrig is highly overall correlated with oldbalanceOrg | High correlation |
oldbalanceDest is highly overall correlated with amount and 1 other fields | High correlation |
newbalanceDest is highly overall correlated with amount and 1 other fields | High correlation |
isFraud is highly skewed (γ1 = 30.2521979) | Skewed |
nameOrig is uniformly distributed | Uniform |
oldbalanceOrg has 342214 (32.6%) zeros | Zeros |
newbalanceOrig has 580275 (55.3%) zeros | Zeros |
oldbalanceDest has 437134 (41.7%) zeros | Zeros |
newbalanceDest has 406914 (38.8%) zeros | Zeros |
isFraud has 1047433 (99.9%) zeros | Zeros |
isFlaggedFraud has 1048575 (100.0%) zeros | Zeros |
Reproduction
| Analysis started | 2023-04-20 15:36:57.181278 |
|---|---|
| Analysis finished | 2023-04-20 15:40:06.719225 |
| Duration | 3 minutes and 9.54 seconds |
| Software version | ydata-profiling vv4.0.0 |
| Download configuration | config.json |
step
Real number (ℝ)
| Distinct | 95 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.966174 |
| Minimum | 1 |
|---|---|
| Maximum | 95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 15 |
| median | 20 |
| Q3 | 39 |
| 95-th percentile | 45 |
| Maximum | 95 |
| Range | 94 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 15.623252 |
|---|---|
| Coefficient of variation (CV) | 0.57936478 |
| Kurtosis | 3.4335532 |
| Mean | 26.966174 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 1.2944546 |
| Sum | 28276056 |
| Variance | 244.08599 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 19 | 51352 | 4.9% |
| 18 | 49579 | 4.7% |
| 43 | 45060 | 4.3% |
| 15 | 44609 | 4.3% |
| 17 | 43361 | 4.1% |
| 16 | 42471 | 4.1% |
| 14 | 41485 | 4.0% |
| 42 | 41304 | 3.9% |
| 20 | 40625 | 3.9% |
| 36 | 39774 | 3.8% |
| Other values (85) | 608955 |
| Value | Count | Frequency (%) |
| 1 | 2708 | 0.3% |
| 2 | 1014 | 0.1% |
| 3 | 552 | 0.1% |
| 4 | 565 | 0.1% |
| 5 | 665 | 0.1% |
| 6 | 1660 | 0.2% |
| 7 | 6837 | 0.7% |
| 8 | 21097 | |
| 9 | 37628 | |
| 10 | 35991 |
| Value | Count | Frequency (%) |
| 95 | 2980 | 0.3% |
| 94 | 10372 | |
| 93 | 4444 | |
| 92 | 10 | < 0.1% |
| 91 | 8 | < 0.1% |
| 90 | 16 | < 0.1% |
| 89 | 6 | < 0.1% |
| 88 | 8 | < 0.1% |
| 87 | 6 | < 0.1% |
| 86 | 18 | < 0.1% |
type
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.0 MiB |
| CASH_OUT | |
|---|---|
| PAYMENT | |
| CASH_IN | |
| TRANSFER | |
| DEBIT | 7178 |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.4253754 |
| Min length | 5 |
Characters and Unicode
| Total characters | 7786063 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PAYMENT |
|---|---|
| 2nd row | PAYMENT |
| 3rd row | TRANSFER |
| 4th row | CASH_OUT |
| 5th row | PAYMENT |
Common Values
| Value | Count | Frequency (%) |
| CASH_OUT | 373641 | |
| PAYMENT | 353873 | |
| CASH_IN | 227130 | |
| TRANSFER | 86753 | 8.3% |
| DEBIT | 7178 | 0.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cash_out | 373641 | |
| payment | 353873 | |
| cash_in | 227130 | |
| transfer | 86753 | 8.3% |
| debit | 7178 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1041397 | |
| T | 821445 | |
| S | 687524 | |
| N | 667756 | |
| C | 600771 | 7.7% |
| H | 600771 | 7.7% |
| _ | 600771 | 7.7% |
| E | 447804 | 5.8% |
| O | 373641 | 4.8% |
| U | 373641 | 4.8% |
| Other values (8) | 1570542 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7185292 | |
| Connector Punctuation | 600771 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1041397 | |
| T | 821445 | |
| S | 687524 | |
| N | 667756 | |
| C | 600771 | |
| H | 600771 | |
| E | 447804 | 6.2% |
| O | 373641 | 5.2% |
| U | 373641 | 5.2% |
| Y | 353873 | 4.9% |
| Other values (7) | 1216669 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 600771 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7185292 | |
| Common | 600771 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1041397 | |
| T | 821445 | |
| S | 687524 | |
| N | 667756 | |
| C | 600771 | |
| H | 600771 | |
| E | 447804 | 6.2% |
| O | 373641 | 5.2% |
| U | 373641 | 5.2% |
| Y | 353873 | 4.9% |
| Other values (7) | 1216669 |
Common
| Value | Count | Frequency (%) |
| _ | 600771 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7786063 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1041397 | |
| T | 821445 | |
| S | 687524 | |
| N | 667756 | |
| C | 600771 | 7.7% |
| H | 600771 | 7.7% |
| _ | 600771 | 7.7% |
| E | 447804 | 5.8% |
| O | 373641 | 4.8% |
| U | 373641 | 4.8% |
| Other values (8) | 1570542 |
amount
Real number (ℝ)
| Distinct | 1009606 |
|---|---|
| Distinct (%) | 96.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 158666.98 |
| Minimum | 0.1 |
|---|---|
| Maximum | 10000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 MiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 2097.002 |
| Q1 | 12149.065 |
| median | 76343.33 |
| Q3 | 213761.89 |
| 95-th percentile | 519460.35 |
| Maximum | 10000000 |
| Range | 9999999.9 |
| Interquartile range (IQR) | 201612.83 |
Descriptive statistics
| Standard deviation | 264940.93 |
|---|---|
| Coefficient of variation (CV) | 1.6697925 |
| Kurtosis | 96.805427 |
| Mean | 158666.98 |
| Median Absolute Deviation (MAD) | 70322.84 |
| Skewness | 6.3741657 |
| Sum | 1.6637422 × 1011 |
| Variance | 7.0193697 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000000 | 14 | < 0.1% |
| 706.25 | 6 | < 0.1% |
| 1711.67 | 5 | < 0.1% |
| 3172.71 | 5 | < 0.1% |
| 5838.16 | 5 | < 0.1% |
| 9217.19 | 5 | < 0.1% |
| 3279.19 | 5 | < 0.1% |
| 3216.8 | 5 | < 0.1% |
| 2432.1 | 5 | < 0.1% |
| 5909.55 | 5 | < 0.1% |
| Other values (1009596) | 1048515 |
| Value | Count | Frequency (%) |
| 0.1 | 1 | |
| 0.14 | 1 | |
| 0.2 | 1 | |
| 0.26 | 1 | |
| 0.3 | 1 | |
| 0.32 | 1 | |
| 0.37 | 1 | |
| 0.5 | 1 | |
| 0.52 | 1 | |
| 0.63 | 2 |
| Value | Count | Frequency (%) |
| 10000000 | 14 | |
| 9977761.05 | 2 | < 0.1% |
| 9887819.06 | 2 | < 0.1% |
| 9465988.82 | 2 | < 0.1% |
| 9345700.07 | 2 | < 0.1% |
| 9039246.82 | 2 | < 0.1% |
| 8931607.89 | 2 | < 0.1% |
| 8924971.59 | 2 | < 0.1% |
| 8594065.09 | 2 | < 0.1% |
| 7937954.2 | 2 | < 0.1% |
nameOrig
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 1048317 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.0 MiB |
| C1214450722 | 2 |
|---|---|
| C309111136 | 2 |
| C1268675361 | 2 |
| C720460198 | 2 |
| C1109092856 | 2 |
| Other values (1048312) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.482179 |
| Min length | 5 |
Characters and Unicode
| Total characters | 10991351 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1048059 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | C1231006815 |
|---|---|
| 2nd row | C1666544295 |
| 3rd row | C1305486145 |
| 4th row | C840083671 |
| 5th row | C2048537720 |
Common Values
| Value | Count | Frequency (%) |
| C1214450722 | 2 | < 0.1% |
| C309111136 | 2 | < 0.1% |
| C1268675361 | 2 | < 0.1% |
| C720460198 | 2 | < 0.1% |
| C1109092856 | 2 | < 0.1% |
| C545402485 | 2 | < 0.1% |
| C1362689728 | 2 | < 0.1% |
| C110179857 | 2 | < 0.1% |
| C1467095135 | 2 | < 0.1% |
| C2073023524 | 2 | < 0.1% |
| Other values (1048307) | 1048555 |
Length
| Value | Count | Frequency (%) |
| c1214450722 | 2 | < 0.1% |
| c443816828 | 2 | < 0.1% |
| c645536800 | 2 | < 0.1% |
| c301090204 | 2 | < 0.1% |
| c563955235 | 2 | < 0.1% |
| c150158085 | 2 | < 0.1% |
| c2038530463 | 2 | < 0.1% |
| c1378765159 | 2 | < 0.1% |
| c556791598 | 2 | < 0.1% |
| c263263252 | 2 | < 0.1% |
| Other values (1048307) | 1048555 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1450056 | |
| C | 1048575 | |
| 2 | 1011021 | |
| 3 | 939627 | |
| 4 | 938760 | |
| 6 | 935202 | |
| 5 | 935188 | |
| 0 | 933624 | |
| 7 | 933248 | |
| 9 | 933108 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9942776 | |
| Uppercase Letter | 1048575 | 9.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1450056 | |
| 2 | 1011021 | |
| 3 | 939627 | |
| 4 | 938760 | |
| 6 | 935202 | |
| 5 | 935188 | |
| 0 | 933624 | |
| 7 | 933248 | |
| 9 | 933108 | |
| 8 | 932942 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1048575 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9942776 | |
| Latin | 1048575 | 9.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1450056 | |
| 2 | 1011021 | |
| 3 | 939627 | |
| 4 | 938760 | |
| 6 | 935202 | |
| 5 | 935188 | |
| 0 | 933624 | |
| 7 | 933248 | |
| 9 | 933108 | |
| 8 | 932942 |
Latin
| Value | Count | Frequency (%) |
| C | 1048575 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10991351 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1450056 | |
| C | 1048575 | |
| 2 | 1011021 | |
| 3 | 939627 | |
| 4 | 938760 | |
| 6 | 935202 | |
| 5 | 935188 | |
| 0 | 933624 | |
| 7 | 933248 | |
| 9 | 933108 |
oldbalanceOrg
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 391033 |
|---|---|
| Distinct (%) | 37.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 874009.54 |
| Minimum | 0 |
|---|---|
| Maximum | 38900000 |
| Zeros | 342214 |
| Zeros (%) | 32.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 16002 |
| Q3 | 136642.02 |
| 95-th percentile | 6007521 |
| Maximum | 38900000 |
| Range | 38900000 |
| Interquartile range (IQR) | 136642.02 |
Descriptive statistics
| Standard deviation | 2971750.6 |
|---|---|
| Coefficient of variation (CV) | 3.4001351 |
| Kurtosis | 30.877779 |
| Mean | 874009.54 |
| Median Absolute Deviation (MAD) | 16002 |
| Skewness | 5.1242857 |
| Sum | 9.1646456 × 1011 |
| Variance | 8.8313014 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 342214 | |
| 10100000 | 433 | < 0.1% |
| 10300000 | 424 | < 0.1% |
| 10200000 | 421 | < 0.1% |
| 10900000 | 387 | < 0.1% |
| 10400000 | 379 | < 0.1% |
| 10700000 | 378 | < 0.1% |
| 10600000 | 376 | < 0.1% |
| 10500000 | 375 | < 0.1% |
| 11000000 | 337 | < 0.1% |
| Other values (391023) | 702851 |
| Value | Count | Frequency (%) |
| 0 | 342214 | |
| 0.67 | 1 | < 0.1% |
| 1 | 57 | < 0.1% |
| 1.7 | 1 | < 0.1% |
| 2 | 51 | < 0.1% |
| 2.36 | 1 | < 0.1% |
| 3 | 53 | < 0.1% |
| 4 | 54 | < 0.1% |
| 4.58 | 1 | < 0.1% |
| 4.98 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 38900000 | 1 | |
| 38600000 | 1 | |
| 38400000 | 2 | |
| 38300000 | 1 | |
| 38200000 | 1 | |
| 38000000 | 1 | |
| 37900000 | 1 | |
| 37500000 | 1 | |
| 37300000 | 1 | |
| 36700000 | 1 |
newbalanceOrig
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 440792 |
|---|---|
| Distinct (%) | 42.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 893808.9 |
| Minimum | 0 |
|---|---|
| Maximum | 38900000 |
| Zeros | 580275 |
| Zeros (%) | 55.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 174599.99 |
| 95-th percentile | 6168354.9 |
| Maximum | 38900000 |
| Range | 38900000 |
| Interquartile range (IQR) | 174599.99 |
Descriptive statistics
| Standard deviation | 3008271.3 |
|---|---|
| Coefficient of variation (CV) | 3.3656762 |
| Kurtosis | 30.139652 |
| Mean | 893808.9 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.0604564 |
| Sum | 9.3722567 × 1011 |
| Variance | 9.0496964 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 580275 | |
| 10300000 | 450 | < 0.1% |
| 10100000 | 449 | < 0.1% |
| 10200000 | 435 | < 0.1% |
| 10900000 | 405 | < 0.1% |
| 10700000 | 400 | < 0.1% |
| 10400000 | 399 | < 0.1% |
| 10600000 | 391 | < 0.1% |
| 10500000 | 388 | < 0.1% |
| 11000000 | 356 | < 0.1% |
| Other values (440782) | 464627 |
| Value | Count | Frequency (%) |
| 0 | 580275 | |
| 0.67 | 1 | < 0.1% |
| 0.73 | 1 | < 0.1% |
| 1.17 | 1 | < 0.1% |
| 1.48 | 1 | < 0.1% |
| 1.52 | 1 | < 0.1% |
| 1.63 | 1 | < 0.1% |
| 1.7 | 1 | < 0.1% |
| 1.78 | 1 | < 0.1% |
| 2.01 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 38900000 | 2 | |
| 38600000 | 1 | |
| 38400000 | 2 | |
| 38300000 | 1 | |
| 38200000 | 1 | |
| 38000000 | 1 | |
| 37900000 | 1 | |
| 37500000 | 1 | |
| 37300000 | 1 | |
| 36700000 | 1 |
nameDest
Categorical
| Distinct | 449635 |
|---|---|
| Distinct (%) | 42.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.0 MiB |
| C985934102 | 98 |
|---|---|
| C1286084959 | 96 |
| C1590550415 | 89 |
| C248609774 | 88 |
| C665576141 | 87 |
| Other values (449630) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.479478 |
| Min length | 4 |
Characters and Unicode
| Total characters | 10988519 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 371364 ? |
|---|---|
| Unique (%) | 35.4% |
Sample
| 1st row | M1979787155 |
|---|---|
| 2nd row | M2044282225 |
| 3rd row | C553264065 |
| 4th row | C38997010 |
| 5th row | M1230701703 |
Common Values
| Value | Count | Frequency (%) |
| C985934102 | 98 | < 0.1% |
| C1286084959 | 96 | < 0.1% |
| C1590550415 | 89 | < 0.1% |
| C248609774 | 88 | < 0.1% |
| C665576141 | 87 | < 0.1% |
| C2083562754 | 86 | < 0.1% |
| C977993101 | 82 | < 0.1% |
| C1360767589 | 81 | < 0.1% |
| C451111351 | 80 | < 0.1% |
| C306206744 | 79 | < 0.1% |
| Other values (449625) | 1047709 |
Length
| Value | Count | Frequency (%) |
| c985934102 | 98 | < 0.1% |
| c1286084959 | 96 | < 0.1% |
| c1590550415 | 89 | < 0.1% |
| c248609774 | 88 | < 0.1% |
| c665576141 | 87 | < 0.1% |
| c2083562754 | 86 | < 0.1% |
| c977993101 | 82 | < 0.1% |
| c1360767589 | 81 | < 0.1% |
| c451111351 | 80 | < 0.1% |
| c306206744 | 79 | < 0.1% |
| Other values (449625) | 1047709 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1446578 | |
| 2 | 1011948 | |
| 8 | 939332 | |
| 3 | 938299 | |
| 4 | 937381 | |
| 0 | 935604 | |
| 7 | 934680 | |
| 6 | 932589 | |
| 5 | 932032 | |
| 9 | 931501 | |
| Other values (2) | 1048575 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9939944 | |
| Uppercase Letter | 1048575 | 9.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1446578 | |
| 2 | 1011948 | |
| 8 | 939332 | |
| 3 | 938299 | |
| 4 | 937381 | |
| 0 | 935604 | |
| 7 | 934680 | |
| 6 | 932589 | |
| 5 | 932032 | |
| 9 | 931501 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 694702 | |
| M | 353873 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9939944 | |
| Latin | 1048575 | 9.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1446578 | |
| 2 | 1011948 | |
| 8 | 939332 | |
| 3 | 938299 | |
| 4 | 937381 | |
| 0 | 935604 | |
| 7 | 934680 | |
| 6 | 932589 | |
| 5 | 932032 | |
| 9 | 931501 |
Latin
| Value | Count | Frequency (%) |
| C | 694702 | |
| M | 353873 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10988519 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1446578 | |
| 2 | 1011948 | |
| 8 | 939332 | |
| 3 | 938299 | |
| 4 | 937381 | |
| 0 | 935604 | |
| 7 | 934680 | |
| 6 | 932589 | |
| 5 | 932032 | |
| 9 | 931501 | |
| Other values (2) | 1048575 |
oldbalanceDest
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 590110 |
|---|---|
| Distinct (%) | 56.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 978160.05 |
| Minimum | 0 |
|---|---|
| Maximum | 42100000 |
| Zeros | 437134 |
| Zeros (%) | 41.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 126377.21 |
| Q3 | 915923.47 |
| 95-th percentile | 4686136.4 |
| Maximum | 42100000 |
| Range | 42100000 |
| Interquartile range (IQR) | 915923.47 |
Descriptive statistics
| Standard deviation | 2296780.4 |
|---|---|
| Coefficient of variation (CV) | 2.3480619 |
| Kurtosis | 42.638314 |
| Mean | 978160.05 |
| Median Absolute Deviation (MAD) | 126377.21 |
| Skewness | 5.3731949 |
| Sum | 1.0256742 × 1012 |
| Variance | 5.2752002 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 437134 | |
| 10100000 | 314 | < 0.1% |
| 10300000 | 304 | < 0.1% |
| 10200000 | 295 | < 0.1% |
| 10900000 | 295 | < 0.1% |
| 10800000 | 291 | < 0.1% |
| 10500000 | 291 | < 0.1% |
| 10600000 | 284 | < 0.1% |
| 10700000 | 276 | < 0.1% |
| 10400000 | 265 | < 0.1% |
| Other values (590100) | 608826 |
| Value | Count | Frequency (%) |
| 0 | 437134 | |
| 0.37 | 1 | < 0.1% |
| 1 | 6 | < 0.1% |
| 2 | 7 | < 0.1% |
| 2.94 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 3.11 | 1 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 4 | < 0.1% |
| 6 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 42100000 | 1 | < 0.1% |
| 41500000 | 1 | < 0.1% |
| 41400000 | 2 | |
| 41300000 | 4 | |
| 41100000 | 3 | |
| 41000000 | 1 | < 0.1% |
| 40900000 | 1 | < 0.1% |
| 39900000 | 1 | < 0.1% |
| 39000000 | 3 | |
| 38900000 | 1 | < 0.1% |
newbalanceDest
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 437054 |
|---|---|
| Distinct (%) | 41.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1114198 |
| Minimum | 0 |
|---|---|
| Maximum | 42200000 |
| Zeros | 406914 |
| Zeros (%) | 38.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 218260.36 |
| Q3 | 1149807.5 |
| 95-th percentile | 5140096.1 |
| Maximum | 42200000 |
| Range | 42200000 |
| Interquartile range (IQR) | 1149807.5 |
Descriptive statistics
| Standard deviation | 2416593.1 |
|---|---|
| Coefficient of variation (CV) | 2.1689082 |
| Kurtosis | 37.4222 |
| Mean | 1114198 |
| Median Absolute Deviation (MAD) | 218260.36 |
| Skewness | 5.0124557 |
| Sum | 1.1683201 × 1012 |
| Variance | 5.8399223 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 406914 | |
| 10200000 | 361 | < 0.1% |
| 10900000 | 350 | < 0.1% |
| 10500000 | 348 | < 0.1% |
| 10100000 | 343 | < 0.1% |
| 10400000 | 327 | < 0.1% |
| 10300000 | 324 | < 0.1% |
| 10800000 | 310 | < 0.1% |
| 11000000 | 304 | < 0.1% |
| 10600000 | 292 | < 0.1% |
| Other values (437044) | 638702 |
| Value | Count | Frequency (%) |
| 0 | 406914 | |
| 0.33 | 1 | < 0.1% |
| 2.94 | 1 | < 0.1% |
| 3.11 | 1 | < 0.1% |
| 7.7 | 1 | < 0.1% |
| 9.69 | 1 | < 0.1% |
| 10.98 | 1 | < 0.1% |
| 12.1 | 4 | < 0.1% |
| 12.82 | 6 | < 0.1% |
| 13.47 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 42200000 | 1 | < 0.1% |
| 42100000 | 1 | < 0.1% |
| 41500000 | 1 | < 0.1% |
| 41400000 | 3 | |
| 41300000 | 5 | |
| 41100000 | 2 | < 0.1% |
| 40900000 | 1 | < 0.1% |
| 39900000 | 2 | < 0.1% |
| 39000000 | 2 | < 0.1% |
| 38900000 | 1 | < 0.1% |
isFraud
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0010890971 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 1047433 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.032983511 |
|---|---|
| Coefficient of variation (CV) | 30.285189 |
| Kurtosis | 913.19722 |
| Mean | 0.0010890971 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 30.252198 |
| Sum | 1142 |
| Variance | 0.001087912 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1047433 | |
| 1 | 1142 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1047433 | |
| 1 | 1142 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1142 | 0.1% |
| 0 | 1047433 |
isFlaggedFraud
Real number (ℝ)
CONSTANT  ZEROS 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0 |
| Minimum | 0 |
|---|---|
| Maximum | 0 |
| Zeros | 1048575 |
| Zeros (%) | 100.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 0 |
| Range | 0 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0 |
|---|---|
| Coefficient of variation (CV) | nan |
| Kurtosis | 0 |
| Mean | 0 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0 |
| Sum | 0 |
| Variance | 0 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 0 | 1048575 |
| Value | Count | Frequency (%) |
| 0 | 1048575 |
| Value | Count | Frequency (%) |
| 0 | 1048575 |
| step | amount | oldbalanceOrg | newbalanceOrig | oldbalanceDest | newbalanceDest | isFraud | type | |
|---|---|---|---|---|---|---|---|---|
| step | 1.000 | -0.036 | -0.024 | -0.022 | 0.007 | -0.002 | 0.026 | 0.048 |
| amount | -0.036 | 1.000 | 0.030 | -0.093 | 0.603 | 0.672 | 0.028 | 0.216 |
| oldbalanceOrg | -0.024 | 0.030 | 1.000 | 0.815 | 0.009 | -0.026 | 0.031 | 0.262 |
| newbalanceOrig | -0.022 | -0.093 | 0.815 | 1.000 | 0.022 | -0.113 | -0.027 | 0.266 |
| oldbalanceDest | 0.007 | 0.603 | 0.009 | 0.022 | 1.000 | 0.925 | -0.016 | 0.093 |
| newbalanceDest | -0.002 | 0.672 | -0.026 | -0.113 | 0.925 | 1.000 | -0.005 | 0.109 |
| isFraud | 0.026 | 0.028 | 0.031 | -0.027 | -0.016 | -0.005 | 1.000 | 0.054 |
| type | 0.048 | 0.216 | 0.262 | 0.266 | 0.093 | 0.109 | 0.054 | 1.000 |
| step | type | amount | nameOrig | oldbalanceOrg | newbalanceOrig | nameDest | oldbalanceDest | newbalanceDest | isFraud | isFlaggedFraud | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | PAYMENT | 9839.64 | C1231006815 | 170136.00 | 160296.36 | M1979787155 | 0.0 | 0.00 | 0 | 0 |
| 1 | 1 | PAYMENT | 1864.28 | C1666544295 | 21249.00 | 19384.72 | M2044282225 | 0.0 | 0.00 | 0 | 0 |
| 2 | 1 | TRANSFER | 181.00 | C1305486145 | 181.00 | 0.00 | C553264065 | 0.0 | 0.00 | 1 | 0 |
| 3 | 1 | CASH_OUT | 181.00 | C840083671 | 181.00 | 0.00 | C38997010 | 21182.0 | 0.00 | 1 | 0 |
| 4 | 1 | PAYMENT | 11668.14 | C2048537720 | 41554.00 | 29885.86 | M1230701703 | 0.0 | 0.00 | 0 | 0 |
| 5 | 1 | PAYMENT | 7817.71 | C90045638 | 53860.00 | 46042.29 | M573487274 | 0.0 | 0.00 | 0 | 0 |
| 6 | 1 | PAYMENT | 7107.77 | C154988899 | 183195.00 | 176087.23 | M408069119 | 0.0 | 0.00 | 0 | 0 |
| 7 | 1 | PAYMENT | 7861.64 | C1912850431 | 176087.23 | 168225.59 | M633326333 | 0.0 | 0.00 | 0 | 0 |
| 8 | 1 | PAYMENT | 4024.36 | C1265012928 | 2671.00 | 0.00 | M1176932104 | 0.0 | 0.00 | 0 | 0 |
| 9 | 1 | DEBIT | 5337.77 | C712410124 | 41720.00 | 36382.23 | C195600860 | 41898.0 | 40348.79 | 0 | 0 |
| step | type | amount | nameOrig | oldbalanceOrg | newbalanceOrig | nameDest | oldbalanceDest | newbalanceDest | isFraud | isFlaggedFraud | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1048565 | 95 | TRANSFER | 132387.24 | C1654402840 | 15956.51 | 0.00 | C1878219072 | 631284.08 | 763671.32 | 0 | 0 |
| 1048566 | 95 | PAYMENT | 12598.15 | C565523855 | 30601.00 | 18002.85 | M1740980642 | 0.00 | 0.00 | 0 | 0 |
| 1048567 | 95 | CASH_OUT | 279674.05 | C990252469 | 18002.85 | 0.00 | C574439165 | 1847488.28 | 2127162.32 | 0 | 0 |
| 1048568 | 95 | PAYMENT | 20721.54 | C954269986 | 49732.00 | 29010.46 | M812667644 | 0.00 | 0.00 | 0 | 0 |
| 1048569 | 95 | PAYMENT | 3210.11 | C2113264897 | 11113.00 | 7902.89 | M1989479599 | 0.00 | 0.00 | 0 | 0 |
| 1048570 | 95 | CASH_OUT | 132557.35 | C1179511630 | 479803.00 | 347245.65 | C435674507 | 484329.37 | 616886.72 | 0 | 0 |
| 1048571 | 95 | PAYMENT | 9917.36 | C1956161225 | 90545.00 | 80627.64 | M668364942 | 0.00 | 0.00 | 0 | 0 |
| 1048572 | 95 | PAYMENT | 14140.05 | C2037964975 | 20545.00 | 6404.95 | M1355182933 | 0.00 | 0.00 | 0 | 0 |
| 1048573 | 95 | PAYMENT | 10020.05 | C1633237354 | 90605.00 | 80584.95 | M1964992463 | 0.00 | 0.00 | 0 | 0 |
| 1048574 | 95 | PAYMENT | 11450.03 | C1264356443 | 80584.95 | 69134.92 | M677577406 | 0.00 | 0.00 | 0 | 0 |